A Feature Selection based Approach for Speaker and Language Identification in VoIP Networks

نویسندگان

  • J J Ranade
  • Indranil Sengupta
چکیده

fulfillment for the award of the degree of Master of Technology (Computer and Information Technology) is a bona-fide record of work carried out by him under our supervision and guidance. The thesis has fulfilled all the requirements as per the regulations of the Institute and, in our opinion, has reached the standard needed for submission. ii Acknowledgements The writing of this thesis has been a long and arduous journey of learning. From the initial concept to the final product, this thesis has been the input from many individuals. I extend my most sincere gratitude to Prof. Indranil Sengupta and Dr. S.K. Ghosh, my supervisors, for their continued support, constructive comments and encouragement. Without their excellent guidance, this work would not have taken the present shape. I feel privileged to have got this opportunity to work with them and learn from their vast academic and research experience. and Political Science for sharing his knowledge and valuable thoughts for the improvements in the basic approach presented in this thesis. Also I would like to Dr. M. Patracca of Politecnico di Torino, Italy for suggesting valuable improvements in few experiments presented in this thesis. Lastly and most importantly, I am thankful and indebted to my organization, the Indian Army, for giving me an opportunity to pursue higher education and research in an elite academic institute. I am grateful for the love and support of my family – my father and my wife, Sukhjeet and my two sons Maitrey and Tej for their love and affection, which kept me motivated throughout my work on this thesis. Without their support, I would not have been able to make it this far. Abstract In VoIP networks, generally the speech is transmitted in the compressed format using some speech compression algorithm, whereas typical automatic speaker or language identification systems are not capable of handling compressed speech. Hence the compressed speech has to be re-synthesized to original waveform in order to get it processed through a speaker or language identification system which involves a considerable computational overhead. Thereby the speaker and language identification systems based on normal speech waveform are not suitable for most of the futuristic speaker or language identification based VoIP applications where the ability to identify a number of speaker or languages in real-time will be essential. In this thesis, we propose an on-line speaker and language identification scheme where Dimensionally Reduced Significant Statistical (DRSS) …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

AN INTELLIGENT FAULT DIAGNOSIS APPROACH FOR GEARS AND BEARINGS BASED ON WAVELET TRANSFORM AS A PREPROCESSOR AND ARTIFICIAL NEURAL NETWORKS

In this paper, a fault diagnosis system based on discrete wavelet transform (DWT) and artificial neural networks (ANNs) is designed to diagnose different types of fault in gears and bearings. DWT is an advanced signal-processing technique for fault detection and identification. Five features of wavelet transform RMS, crest factor, kurtosis, standard deviation and skewness of discrete wavelet co...

متن کامل

A General Investigation on the Combination of Local and Global Feature Selection Methods for Request Identification in Telegram

Nowadays, the use of various messaging services is expanding worldwide with the rapid development of Internet technologies. Telegram is a cloud-based open-source text messaging service. According to the US Securities and Exchange Commission and based on the statistics given for October 2019 to present, 300 million people worldwide used telegram per month. Telegram users are more concentrated in...

متن کامل

Determining Effective Features for Face Detection Using a Hybrid Feature Approach

Detecting faces in cluttered backgrounds and real world has remained as an unsolved problem yet. In this paper, by using composition of some kind of independent features and one of the most common appearance based approaches, and multilayered perceptron (MLP) neural networks, not only some questions have been answered, but also the designed system achieved better performance rather than the pre...

متن کامل

Effective Feature Selection for Pre-Cancerous Cervix Lesions Using Artificial Neural Networks

Since most common form of cervical cancer starts with pre-cancerous changes, a flawless detection of these changes becomes an important issue to prevent and treat the cervix cancer. There are 2 ways to stop this disease from developing. One way is to find and treat pre-cancers before they become true cancers, and the other is to prevent the pre-cancers in the first place. The presented approach...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008